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(57)Abstract: 

PURPOSE: To execute vocal training actively by storing voice 
feature parameters of a model and a vocal training person and 
displaying these parameters in parallel every vocal training. 
CONSTITUTION: A vocal training device is constituted of a 
microphone 1 for inputting a vocal sound, an A/D converter 2 
for converting the inputted vocal sound into digital data, a voice 
analyzing circuit 3 for converting the digital data into a voice 
feature parameter time series, a memory writing .control circuit 4 
for the time series, a display part 14, a voice synthesizing circuit 
1 1, a speaker 13, etc. A vocal sound of the vocal training person 
which is inputted through said microphone 1 is converted into a 
voice feature parameter time series and the converted data are 
stored in a RAMS and displayed in parallel with the voice 
feature of the model on the display part 14. The vocal training 
person repeats vocal training on the basis of the displayed 
information and corrects the accent, intonation and rhythm of his 
voice, so that the degree of training effects can be grasped by 
himself 
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A vocal training device for practicing vocal training by oneself based on model voice data 
displayed as visible data or reproduced as a voice, comprising model voice data storing means which 
have stored a predetermined vocal text as model voice feature parameters; voice analyzing means for 
analyzing a feature of data of a voice uttered by a vocal training person; accumulating means for 
accumulating and storing voice feature parameters of the vocal training person, which are outputs 
from the voice analyzing means; feature parameter modifying means for modifying accumulated and 
stored voice feature parameters of the model voice data and the vocal training person's voice data; 
and voice synthesizing means for synthesizing a voice from accumulated and stored voice feature 
parameters of the model voice data and the vocal training person's voice data. 

Detailed Description of the Invention: 



(1) (Stage before listening to model voice data) 

For predetermined text items for vocal training, a vocal sound to be a model is analyzed, 
converted to a voice feature parameter time series and stored in the memory (ROM) 8. 

As the voice analysis/synthesis method used here, there are various methods. In this 
embodiment, it is assumed that the PARCOR analysis/synthesis method commonly used is to be used. 

The memory reading control circuit 9 reads voice feature parameters from this memory 
(ROM) 8 and outputs them to the voice synthesizing circuit 1 1 via the switching control circuit 10. 
The voice synthesizing circuit 1 1 synthesizes and creates voice for items to train based on the voice 
feature parameters, and reproduces it from the speaker 13 to have a vocal training person listen to it. 

It is possible to modify a voice feature parameter time series and synthesize multiple voices as 
shown in Figure 2 to have the vocal training person listen to it. The voice feature parameters to be 
modified are the vocal code information parameter and the amplitude parameter. 

The vocal code information parameter corresponds to the accent and intonation of an uttered 
voice, and the amplitude parameter corresponds to the strength and weakness of the uttered voice. 

(2) (Stage at which training is performed based on the model voice data, and the effect of the 
training is checked) 

The vocal training person performs vocal training based on the synthesized vocal sound to be 
a model in accordance with instructions, inputs his voice from the microphone 1 into the voice 
analyzing circuit 3 via the A/D converter 2, performs voice analysis with the PARCOR 
analysis/synthesis method described in (1) to make conversion to a voice feature parameter time 
series, and stores the voice feature parameter time series in the memory (RAM) 5 via the memory 
writing control circuit 4. 

In this case, the voice features of the model voice and the vocal training person's voice are 
displayed in parallel on the display part 14, as shown in Figure 3. The voice cord information 
parameter time series and the amplitude parameter time series are used as the data to be displayed. 

• • • a 

In Figure 3, reference numerals (1), (l) 1 , (2) and (2)' denote the amplitude information about 
the model voice, the voice cord information about the model voice, the amplitude information about 
the vocal training person, and the voice cord information about the vocal training person, 
respectively. 

The vocal training person repeats vocal training based on the displayed information. For 
example, by comparing and checking the displayed information, he tries to utter voice by paying 
attention to the accent or intonation to bring the voice cord information (2)' close to the model voice 
and tries to utter voice by paying attention to the strength and weakness of voice to bring the 
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amplitude information (1)' close to the model voice. As for to what degree the accent, intonation, 
and strength and weakness should be specifically modified, description will be made below about the 
modification to be made for each of the displayed information. 

That is, for example, in order to modify the voice cord information (2)' about the vocal 
training person, time T to be modified (all the horizontal axes in Figures 3 and 4 indicate time) and a 
modification degree n (this is a quantized modification degree, and it corresponds to the displayed 
scale value and also corresponds to a modification degree inputted from an input part 15) are 
specified and inputted from the input part 15, and thereby the parameter of the time T of the voice 
cord information parameter time series is modified by the modification degree n as shown in Figure 4. 

In this case, the voice cord information parameters t hours before and after the time T are 
also corrected. There are also various methods for making correction for t hours before and after 
the time T. In this embodiment, however, it is assumed that linear correction is made, that is, the 
parameters are linearly corrected. 

By performing the modification operation described above, it is possible to know to what 
degree the accent, intonation, and strength and weakness should be modified. 

» * * ■ 

According to the present invention described above, there is an advantage that a vocal 
training person can perform vocal training appropriate for his ability by grasping the direction of the 
vocal training and the degree of the effect of the training himself 
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